Probability distributions for polymer translocation 
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We study the passage (translocation) of a self-avoiding polymer through a membrane pore in 
two dimensions. In particular, we numerically measure the probability distribution Q(T) of the 
translocation time T, and the distribution P(s,t) of the translocation coordinate s at various times 
t. When scaled with the mean translocation time (T), Q(T) becomes independent of polymer length, 
and decays exponentially for large T. The probability P(s,t) is well described by a Gaussian at 
short times, with a variance that grows sub-diffusively as t a with a ~ 0.8. For times exceeding (T), 
P(s,t) of the polymers that have not yet finished their translocation has a non-trivial stable shape. 
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I. INTRODUCTION 

Translocation of a long polymer through a narrow 
pore in a membrane has been extensively studied ex- 
perimentally during the last decade P, [H, It is im- 
portant in many biological and chemical processes such 
as viral injection of DNA into a host, RNA transport 
through nanopore of the nuclear membrane It may 
also have practical applications such as the possibility to 
"read" a DNA or RNA sequence by passing it through 
a nanopore such as microfabricated channels or the a- 
hemolysin channel Q. Understanding the dynamics of 
translocation is also of inherent fundamental interest. 
Theoretically, the short time behavior has been inves- 
tigated 

by considering ever more detailed models of 
the interaction between the polymer and the pore in the 
membrane. The microscopic details should not be nec- 
essary to understand the scaling of the passage time for 
very long polymers, where it should suffice to resort to 
rather simple models of the polymer and the membrane. 

It is convenient to track the process with a single vari- 
able s, called the translocation coordinate, that is the 
monomer number at the pore @, H, H, EH , and also indi- 
cates how much of the polymer has passed to the other 
side. In terms of this variable, the translocation process 
begins when the first monomer enters the pore (s = 1) 
and ends when the last monomer exits to the other side at 
s = N, at the translocation time T. If the translocation 
process is sufficiently slow (to allow for the equilibration 
of the polymer), the mean force acting on the monomer 
at the hole can be obtained from a simple calculation 
of free-energies. This is a reasonable approximation for 
the experimental results of relatively short polymers Q . 
The reduction of entropy creates a weak potential barrier, 
and the translocation problem becomes equivalent to the 
escape of a 'particle' (the translocation coordinate) over 
this barrier. As is the case for diffusion over an interval of 
length TV, the mean translocation time is found to scale 
as (T) ~ TV 2 . (The logarithmic potential due to entropy 
is too weak to modify this scaling.) 

In the absence of hydrodynamic interactions, the re- 



laxation time r of a polymer scales with the number 
of monomers as N 1+2v [ll], E3, where the exponent v 
characterizes scaling of the radius of gyration R g of the 
polymer by R g ~ N v . In good solvent v = 3/4 in two 
dimensions (2D), and v w 0.59 in three dimensions (3D). 
Note that r is of the order of the time the polymer needs 
to diffuse its own R g . Since r grows faster than N 2 , the 
quasi-equilibrium approach to translocation, described in 
the previous paragraph, must fail for sufficiently large 
N. The relaxation process slows down the passage of the 
polymer, and the stochastic forces acting on the translo- 
cation coordinate must be anti-correlated. Initial nu- 
merical simulations suggest fl3| that the resulting mean 
translocation time (T) scales like the relaxation time r, 
i.e. (T) ~ N 1+2u . This suggests that, to the extent 
that the resulting process can be regarded as station- 
ary, the translocation coordinate s executes anomalous 
(subdiffusive) motion. Simple scaling considerations lead 
to the conclusion [l3j], that the variance of the translo- 
cation coordinate Ss 2 increases with time t as t a with 
a = 2/(1 + 1v). This power is obtained by the require- 
ment that for t — (T), Ss 2 ~ TV 2 , i.e. the translocation 
is complete. 

Anomalous diffusion in translocation is closely related 
to the behavior of a tagged monomer in a long poly- 
mer 14|: At short time scales, before a monomer feels 
the effects of its neighbors, it undergoes rapid normal 
diffusion with diffusion constant D a . At very long times, 
exceeding the relaxation time r, the entire polymer (and 
hence each monomer) diffuses along with the center of 
mass of the polymer, with a slow diffusion constant of 
D Q /N. At intermediate times, the fluctuations of the 
monomer are independent of the total length, and to 
match the final A-dependent diffusion constant must 
be characterized by anomalous diffusion 14]. Both the 
tagged monomer and the translocation coordinate are 
slowed down by the couplings to the rest of the poly- 
mer, and undergo subdiffusion since the variances of the 
relevant variables increase sublinearly (a < 1). There are 
of course important differences: the diffusion of a tagged 
monomer is executed in d-dimensional real space, while 
the translocation coordinate moves along the one dimen- 
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sional axis of monomer numbers. The scaling exponents 
are also different, although their values are derived from 
closely related considerations. Recently we studied the 
constrained motion of a tagged monomer as an indirect 
means of gai ning insight into the distribution of translo- 
cation times [fa ]. 

There has been much recent progress in the theoret- 
ical modelling of translocation: hydrodynamic interac- 
tions were taken into account [Hil Il7j , and an intuitive 
scaling picture of polymer translocation under the influ- 
ence of a force was developed. A variety of scaling 
regimes with force applied to the end-point or at the por e 
have been investigated numerically in some detail [2fJ. 
Some recent studies [U [22[ suggest that the transloca- 
tion process in 3D maybe even slower than dictated by 
the relaxation time. If so, this would weaken the anal- 
ogy between translocation and the motion of a tagged 
monomer. (The accuracy of these claims is questioned in 
further work (23J.) 

Since the translocation process is terminated when s 
reaches one end of the polymer, one may draw an anal- 
ogy to the anomalous diffuser in the presence of absorb- 
ing boundaries. One approach frequently used to de- 
scribe subdiffusion is the fractional diffusion equation 
(FDE) [H]. Solutions of FDE in the presence of ab- 
sorbing boundaries predict that for large t the absorp- 
tion probability Q(t) decays as l/t a+1 [25|]. In the case 
of subdiffusion (a < 1) this decay is so slow that the 
mean absorbtion time diverges. By applying this anal- 
ogy to translocation it has been suggested [26[ that the 
mean translocation time is infinite, and there is a nu- 
merical study [22j lends support for a power-law tail in 
the distribution of translocation times. If so, this would 
imply that the experimentally and numerically measured 
translocation times are artifacts of the finite duration of 
the experiment. However, this proposition is not sup- 
ported by experiments or other numerical simulations. 

To address this controversy, we recently considered [l5[ 
the motion of a tagged monomer belonging to a very long 
phantom (Gaussian) polymer, moving in one-dimension 
between two absorbing boundaries. We demonstrated 
that at least in this case, Q(t) decays exponentially even 
though the monomer undergoes subdiffusion. While this 
casts strong doubts to the generality and relevance of 
the conclusions based on FDE, it does not directly ad- 
dress the dynamics of translocation, and thus not nec- 
essarily contradict the conclusions of Ref. [2^]. Thus, 
in this work, we perform a direct and detailed study of 
the translocation process for a self-avoiding polymer in 
2D. We concentrate on the behavior of the distribution 
of the translocation times, and also on the stationary 
distribution of the translocation coordinate at very long 
times. In Sec. [TT] we describe our numerical model and 
the Monte Carlo (MC) procedure. The results presented 
in Sec. IIIII demonstrate that for large t the distribution 
of the translocation times decays exponentially, while 
the long-time distribution of the translocation coordinate 
takes a non-trivial form. 



II. THE MODEL AND SIMULATIONS 

Simulations of the self-avoiding polymer translocating 
through a membrane were performed with a fluctuating 
bond polymer model [13] in 2D. In this model, the N 
monomers are restricted to the sites of a square lattice. 
Excluded volume interactions are implemented by forbid- 
ding two monomers to be closer than 2 lattice constants, 
while the polymeric character is enforced by requiring the 
separation between monomers adjacent along the chain 
to be less than \/l0 lattice constants. This choice of 
minimal and maximal distances ensures that the poly- 
mer never intersects itself. The model contains no energy 
scale, leading to an extremely simple Monte Carlo pro- 
cedure: An elementary move consists of an attempt to 
move a randomly selected monomer by one lattice spac- 
ing in an arbitrarily chosen direction. If the new config- 
uration is permitted, the step is executed; otherwise, the 
configuration remains unchanged. One MC time unit is 
composed of N elementary moves. This model closely 
resembles tethered spheres used in continuum [28| simu- 
lations. We previously used this model to demonstrate 
the anomalous dynamics of polymer translocation [l3T ]. 
The membrane intervening membrane in simulations has 
a thickness of two lattice constants, with a hole that is 
three lattice spacings wide. The tight size ensures that 
only one monomer can pass through the hole, and enables 
a unique designation of the monomer s which separates 
the polymer segments on the two sides of the membrane. 

The translocation process in actuality involves several 
complicating factors: The polymer located on one side 
of the membrane must first reach the pore such that one 
end enters the pore. In the absence of strong driving 
force, it is then quite likely that the polymer retracts 
and does not pass through to the other side until a num- 
ber of such attempts. Both of these processes have been 
discussed in the literature. Since we are only interest- 
ing in the anomalous dynamics during the translocation 
process, we implement a computation procedure that is 
different from the usual experimental conditions. In our 
simulations, an the initial configuration is constructed by 
fixing the monomer s — N/2 in the hole, and equilibrat- 
ing the remaining monomers for more than the relaxation 
time (which is proportional to TV 2 5 ) [Tl|. After this equi- 
libration is finished, at time t — 0, the fixed monomer is 
allowed to move freely. The simulation ends at time t = T 
when the entire polymer is on either side of the mem- 
brane. We denote T the translocation time. The proce- 
dure is repeated a large number of times for each polymer 
size N, to construct the probability Qn(T). For each 
simulation run, we also record the trajectory s(t) of the 
translocation coordinate. Consequently, we are able to 
monitor the evolution of the distribution P(s, t). At the 
starting moment P(s,0) = 5 St jsr/2, and it subsequently 
broadens as the time increases. It should be noted that as 
t reaches typical translocation times, the fraction of poly- 
mers that has completed the process starts to grow, and 
consequently the probability distribution P(s.t), which 
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FIG. 1: (Color online) Probability distribution of the translo- 
cation coordinate s, of a polymer with TV = 128 monomers for 
MC times t = 10 4 , 2 x 10 4 , ■ ■ • , and 9 x 10 4 (from narrowest to 
the widest distribution), which are significantly shorter than 
the mean translocation time. The results are obtained from 
10,000 independent runs. Continuous lines represent Gaus- 
sian fits to these distributions. 

is normalized for each t, is obtained from a decreasing 
sample of runs. Another drawback is that the simula- 
tion times increase as JV 3 - 5 , making it difficult to obtain 
good statistics in the interesting limit of large N. We 
performed simulations for N = 8, 16, . . . , 128, and 256. 
Most of the results presented in the paper correspond 
to N — 128, for which we are able to obtain sufficiently 
many samples. 

III. RESULTS 

Each of our simulations begins at s = N/2, and as 
the time t increases the distribution P(s, t) becomes 
broader. As long as t is significantly shorter than the 
mean translocation time (T), the distribution of s re- 
sembles a Gaussian, as can be seen in Fig. [1] and is very 
different from the shapes obtained from the solutions of 
FDE. The plots depict the behavior of s for N = 128, 
where (T) w 2.9 x 10 6 MC time units. Similar shapes 
were observed in the simulations of a tagged monomer in 
a phantom polymer (see Fig. 1 in Ref. [151]), where one 
can prove that the distributions are indeed Gaussian. In 
our case, there is no analytical proof, and we instead 
numerically examined the fourth cumulant K4 of the dis- 
tribution, which vanishes for a Gaussian PDF. Since the 
variable s is discrete, K4 does not vanish, but should be- 
come significantly smaller than the squared second cu- 
mulant (which is the variance of the distribution 6s 2 ), as 
the width of the distribution increases. We find that once 
this width exceeds 2, the cumulant K4 becomes only a few 
percent of (5s 2 ) 2 , and the deviation from zero is probably 



FIG. 2: (Color online) The mean squared displacement of 
the translocation coordinate s as a function of MC time t, 
obtained from 10 4 runs of a 128-monomer polymer. The 
straight line represents the power-law fit t a in the interval 
10 4 < t < 1.2 x 10 5 with exponent a = 0.86. 



caused by the statistical errors. Thus within our statisti- 
cal accuracy, the PDF at short times is indistinguishable 
from a Gaussian. 

As the time goes on, the variance 8s 2 is expected to 
increase as t a , with a w 2/(1 + 2i/) = 0.8, and even- 
tually saturate at values of order N 2 . Figure O depicts 
such dependence on a logarithmic scale. For a range of 
times longer than 10 4 this line has a straight segment 
with slope 0.86, which is slightly larger than the above 
value, and is consistent with the other numbers quoted in 
the literature [13J, l20j . The statistical accuracy of the cal- 
culated exponent is better than the last significant digit 
of the number. However, we believe that systematic er- 
rors related to crossovers and specific choice of the fitting 
range introduce significantly larger (few percent) errors. 
Curves for various values of N saturate approximately at 
5s 2 « (N/5) 2 , and corresponding saturation time T± is 
obtained by extrapolating the (low-t) power-law behav- 
ior to this value. For the various lengths N used in our 
simulations the ratio (T)/!Z\ is approximately constant. 

As the simulation time t exceeds (T), a significant 
fraction of polymers complete their translocation pro- 
cess. For the subsets of runs that survive into such 
long times, the distribution of P(s, t) reaches a stable 
shape. In the case of normal diffusion between two ab- 
sorbing boundaries, the limiting shape is a sine-function 
that vanishes linearly near the boundaries. This shape 
reflects the lowest eigenfunction of the diffusion opera- 
tor (Laplacian), corresponding to the longest decay time 
(eigenvalue). Since we do not know of a corresponding 
differential equation for the translocation coordinate, we 
do not know the corresponding limiting shape. Some in- 
sight into possible solutions can be gleaned by considering 
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FIG. 3: (Color online) Probability distribution of the translo- 
cation variable s of the subset of polymers with N = 128 
monomers that did not complete translocation at MC times 
t = 40 x 10 5 , 42 x 10 5 , • • • , and 48 x 10 5 (thin lines), which 
exceed the mean translocation time. These graphs were ob- 
tained by performing 10 4 independent runs out of which only 
15-20% survive to the times when the data is collected. The 
thick solid line is the average of 10 graphs in the range 
4 x 10 6 < t < 4.9 x 10 6 . The dashed line depicts the fit 
function Asm k (sir/(N + 1)) with k = 1.44. 



a fractional diffusion operator, as in the case of the Lapla- 
cian raised to the power l/a, with absorbing boundary 
conditions (see, e.g. Zoia et al. 29}). In the absence of 
absorbing boundaries the squared width of the distribu- 
tion produced by such a fractional Laplacian increases as 
t a . In the presence of the boundaries, the eigenstates are 
not known for general a. However, it is known that near 
the absorbing boundaries the eigenfunction goes to zero 
nonlinear ly, with an exponent of k = l/a. For normal 
diffusion, this naturally reduces to the expected linear 
form. For subdiffusion, the eigenvalues of this operator 
vanish faster than linearly; in the case a = 0.8 with ex- 
ponent k = 1.25. Figure [3] depicts P[s,t) for N = 128 
and for times exceeding (T) . The statistical accuracy of 
these results is not very good, since a significant fraction 
of the polymers have already translocated. The accu- 
racy is particularly poor near the endpoints of the graph 
where the probability approaches zero. Nevertheless, we 
observe that the function seems to decay faster than lin- 
early and slower than quadratically. The overall shape 
of the distribution can be approximated by the function 
Asm k (sn/(N + 1)). We find a good fit with k = 1.44. 
Qualitatively, Fig. [3] resembles the results obtained for 
tagged monomer diffusion (see Fig. 8 in Ref. [Hi]), but 
with a different exponent k. 

We also studied the probability distribution Q(T) of 
the translocation time. In the range from N = 8 to 
N = 256 we find that the mean translocation time (T) 
increases as TV 2 - 51 consistent with 1 + 2v = 2.5, and in 
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FIG. 4: (Color online) Probability distribution of transloca- 
tion time T for TV = 32, 64 and 128 (left to right), obtained 
from 100,000, 78,600 and 8,600 runs, respectively. The in- 
set demonstrates the collapse of the probabilities when T is 
scaled with its average value (T). 

accord with previous work [IH, [2(| • As in the case of the 
numerical estimate of a, the statistical errors are smaller 
than the last significant digit, but we should beware of 
systematic errors. For example, the exponent 2.51 should 
correspond to a ~ 0.80, which is smaller than the directly 
measured value of 0.86, and indicates the importance of 
systematic errors. Figure @] depicts Q(T) for three val- 
ues of JV on a semi-logarithmic scale. We clearly see 
an exponential decay Q(T) ~ exp(— T/T ) for large T in 
each of the graphs. The decay constant To increases with 
increasing N. The ratio (T)/Tq is approximately a con- 
stant. Moreover, in terms of rescaled times T' = T/(T) 
the distribution becomes independent of N, as can be 
seen in the inset in Fig. Q] 



IV. DISCUSSION AND CONCLUSION 

In this work we performed a detailed study of distribu- 
tion functions associated with the translocation of a 2D 
model of a self-avoiding polymers. Our results clearly in- 
dicated an exponential decay of the PDF Q(T) for large 
translocation times T, and thus exclude power-law [22| 
or stretched exponential [3fj| behavior. The distribution 
of the translocation variable s both at short and long 
times exhibits the behavior resembling that of a tagged 
monomer [l5j . There is some similarity in the behavior of 
the long-times stationary distributions from our simula- 
tions, and solutions of a fractional Laplacian with absorb- 
ing boundaries [2{|. However, the accuracy of our results 
precludes definitive statements regarding these long-time 
distributions. 

Strong crossover effects are present in translocation for 
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surprisingly high values of N. In fact, the values of var- the Israel Science Foundation (Y.K.). C.Ch. thanks the 
ious exponents reported in the literature differ beyond MIT-France Program, 
their nominal error bars. One may hope that for N as 
large as 1,000 this difficulty can be overcome. Unfortu- 
nately, accumulating very large statistics for such large 
N is currently beyond our ability. 
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